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ABSTRACT 

In this work we present an application of general extreme value statistics (GEV) to very mas- 
sive single clusters at high and low redshifts. After introducing the formalism, we apply this 
statistics to four very massive high redshift clusters. Those clusters comprise ACT-CL J0102- 
4915 with a mass of Mzoom = (2.16 + 0.32) x 10'^ Mq at aredshift of z = 0.87, SPT-CL J2106- 
5844 with a mass of Maoom = (1-27 ± 0.21) x 10'^ Mq at z = 1.132 and two clusters found 
by the XMM-Newton Distant Cluster Project survey: XMMU J2235. 32557 with a mass of 
M200C = (7.3 + 1 .3) X lO'"* Mo located at a redshift z = 1 .4 and XMMU J0044.0-2033 having a 
mass in the range of M200C = (3.5 - 5.0) x lO''* Mq at z = 1.579. By relating those systems to 
their corresponding distribution functions of being the most massive system in a given survey 
area and redshift interval, we find that none of the systems alone is in tension with A cold 
dark matter (ACDM). We confront these results with a GEV analysis of four very massive 
low redshift clusters: A2163, A370, RXJ1347-1145 and 1E0657-558, finding no tendency of 
the high-z systems to be more extreme than the low-z ones. 

In addition, we study the extreme quantiles of single clusters at high-z and present contour 
plots for fixed quantiles in the mass vs. survey area plane for four redshift intervals, finding 
that, in order to be significantly in conflict with ACDM, cluster masses would have to be 
substantially higher than the currently observed ones. 

Key words: methods: statistical - galaxies: clusters: individual (ACT- CL J0102-4915, SPT- 
CL J2106-5844, XMMU J0044.0-2033) - galaxies: clusters: general - cosmology: observa- 
tions 



1 INTRODUCTION 

The discovery of the high-redshift cluster XMMU J2235. 32557 
at z = 1.4 by Mullis e t al. (2005) and i t s following Join t X-ray 
and lensing analysis 1 Rosati et aU l2009l : |jee et alj |2009|) . which 
established this system as the most massive cluster at redshift z > 1 
at the time, have motivated a number of studies about the usability 
of very massive galaxy clusters a t high redshift s as a c osmological 
probe JHoIz & Perlmut"te3 1201 d: iHovle et ah 1201 ll; iMantz et af 



20081 12OIOI : iMortonson et alj|201 linHotchkissll2oTTI : iHovle et al 
201 IbO . The majority of those mainly focus on consistency-checks 



of the A cold dark matter (ACDM) concordance model, some dis- 
cuss alt ernatives to the standard model in the f orm of coupled dark 
energy dSaldi &Pettorindl201ll:lBaldill201lh or non-Gaussianity 



jCavon et alJ I2OIII: Enqvistet alJ 1201 ll : iParaniape et all l201li : 
IChongchitnan & Silkl201l h. 

In parallel to these developments, the application of extreme value 
theory to high-mass clusters at high redshifts became increasingly 
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popular. IPavis et aL V2OII') utilised general extreme value st atistics 
(GEV ) (see e.g. ,Gumbel ( 1958); Kotz & Nadarajah (,2000t); IColej 
( l200lh ) in order to study the probability di stribution of the mos t 
massive halo in a given volume. In addition. Icolombi et al.l ( l2oTlh 
applied GEV to the statistics of Gaus si an ran dom fields. Based 
on this groundwork, IWaizmarm et alj ( 1201 ih proposed to use 
GEV for reconstructing the cumulative distribution function of 
massive clusters from high-z cluster surveys and to use it as a 
discrimina nt between di fferent cosmological models. Recently, 
[Uar nson & ColesI ( l201lh also studied extreme value statistics 
based on the exact form rather than the asymptotic one. Other 
applications in the framework of astrophysics are the study of the 
statist ics of the brightest cluster galaxies by iBhavsar & Barrowl 
( Il985l) and the application to temperature maxima in the cosmic 
microwave background by , Coles. Cl988i) . 



In this work we present an application of extreme value 
statistics to individual massive clusters at high and low redshifts by 
computing the probability distribution functions for these clusters. 
We study the probability to find such clusters as the most massive 
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systems also in ill defined survey areas and compare our results 
to the findings of other works. Furthermore, we utilise GEV for 
the computation of extreme quantiles in order to quickly relate an 
arbitrarily observed cluster to its probability of occurrence. 

This paper is structured according to the following scheme. In 
Section |2l we briefly int roduce the applica tion of GEV to massive 
clusters as discussed bv lDavis et al.l ( 12011") . An introduction of the 
clusters studied in this work follows in Section [3] and it is divided 
into subsections for the high-z and low-z systems. We apply GEV 
to the chosen objects in Section |4l in Section 1431 we discuss the 
bias arising from a posteriori choice of the redshift intervals for the 
analysis and in Section 1421 how the observed cluster mass has to 
be corrected for a subsequent statistical analysis. This analysis is 
then performed in Section |43l for the four most massive clusters in 
four a priori defined redshift intervals. In Section [5] we study the 
impact of the survey area on the existence probabilities and com- 
pare the low with the high-z clusters. In the following Section (6] 
we introduce the extreme quantiles based on GEV and compute the 
iso-quantile contours for four redshift intervals in the mass-survey 
area plane. After this, we summarise our findings in the conclusions 
in Sectionl?] 



2 GEV STATISTICS IN A COSMOLOGICAL CONTEXT 

Extreme value theory (for an introduction see e.g. lGumbell ( Il958h : 
iKotz & Nadaraiah' ('2000'); IColesI j200lh ) is concerned with the 
stochastic behaviour of the maxima or minima of i.i.d. random vari- 
ables Xj. In what follows we will only consider the first case, intro- 
ducing the block maximum M„ defined as 

M„ =max(Xi,...X„). (1) 

It has been shown ^Fisher & TiDpetllll928l : lGnedenkdll943h that, 
for n — > oo, the limiting cumulative distribution function (CDF) 
of the renormalised block maxima is given by one of the extreme 
value families: Gumbel (Type I), Frechet (Ty pe II) or Wei bull (Type 
III). A s independently shown by Ivon MisesI ( il954l) and ljenkinsod 
( Il955i) . these three families can be unified as a general extreme 
value distribution (GEV) 



Gy,/},<r(x) : 



-p{-[i-r(T)rl 



exp 



{e-(^)}. 



for 7^0, 
for 7 = 0, 



(2) 



with the shape-, scale- and location parameters y, fi and a. In this 
generalisation, 7 = corresponds to the Type 1, 7 > to Type II and 
7 < to the Type III distributions. The corresponding probability 
density function (PDF) is given by 

dGy,/j.„(x) 



dx 



(3) 



From now on we will adopt the convention that capital initial 
letters denote the CDF (like Gy_/3,a,(x)) and small initial letters 
denote the PDF (like gy,p,a{x)). 

A formalism for the application of GEV on the most m assive 
galaxy clusters has been introduced by iDavis et al] j20I ll) and is 
briefly summarised in the following. By introducing the random 
variable u = log|o(m), the CDF of the most massive halo reads 



Pr{M„ 



(4) 
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Jo 

This probability has to be equal to the one of finding no halo with 



Cluster 




Figure 1. Illustrative scheme for the application of GEV on a single cluster 
observed at Zoha in order to study its probability of existence in a given 
volume V. The volume is defined by the survey area. A,, and the redshift 
interval, z e [ziow 



a mass larger than u. On scales (> lOOMpc/i"'), for which the 
clustering between galaxy clusters can be neglected, the CDF is 

iven by the Pois son distribution for the case of zero occurence 

Davis etalj20Ill) : 



A* exp (-X) 
k\ 



: exp [-neff(> ii)V} , 



(5) 



where neff(> ") is the effective comoving number density of halos 
above mass u = log,Q(m) obtained by averaging and V is the co- 
moving volume. By assuming that equation (|4j can be modelled by 
Gy^p^a(u), it is possible to relate the GEV parameters to cosmologi- 
cal quantities by Taylor-expanding both Gy^fj^ai") and Po(u) around 
the peaks of the corresponding PDFs: 



Po(u) = Po(uo) + 



dPoiu) 



du 



(u -uo) + . 



Gy,/J,ff(M) - Gy,l3^„{U(i) + 



dGy^/i,a(u) 



d u 



(u - Uq) + . 



By comparing t he individual first two expansion terms with each 
other, one finds dDavisetalJbOllI) 



J = n^ff{> mo)V - I, P = 



(1 + 7)"+''' 
d"i.-(r I 

d m l/jiQ 



Vmo In 10 



■ logio'"o - -[(1 +7) 

y 



1], 



(6) 



where mo is the most likely maximum mass and dneff/dm|,„|j is 
the effective mass function evaluated at mo, which relates to the 
effective number density ;icff(> m) via 



dm 



dicff(> in) 



dm 



(7) 



The most likel' 
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likely ma ss, mo, can be found dDavis et al.l I2OIII: 
et alj20ll1) by performing a root search on 



dm 



+ mo 



d «cff 



dm^ 



-I- raoV 



dm 



(8) 



For calcul ating n^^ w e utilised the mass function intro- 
duced by [Tinker et alj ( l2008h and fix the cosmology to 
(/i,nAO,f2„o,a-8) = (0.7,0.73,0.27,0.81) based on the mikin- 
son M icrowave Anisotropy 7-yr ( WMAP7) results dKomatsu et al.l 
I2OIII) . 
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Figure 2. PDFs (left-hand column) and CDFs (right-hand column) for the four most massive clusters in the redshift bins given in Tab. |2] For all four clusters 
the effect of the bias discussed in Section|4]is shown, where the red, dash-dotted lines are the distributions assuming the redshift interval z e [Zobs. Zup] and the 
black, solid lines for the z-interval given in the key (left-hand panels). The Eddington bias coiTected position of the system on the distributions is given by the 
filled black (red) circle for both cases, illustrating the impact of the aforementioned bias. The grey (red) shaded regions represent the regions of uncertainty in 
the mass measurements and the central line in these regions refers to the reported mass of the cluster. The dashed lines denote the most likely expected mass 
of the most massive system for the given survey area and redshift interval. 



4 J.-C. Waizniann, S. Ettori and L. Moscardini 



Table 1. Compilation of the data of the galaxy clusters studied in this work. The masses M200C and M2oom are with respect to the critical and the mean 
background density. The mass M^^^^ gives the mass after the correction for the Eddington bias based on the estimated mass uncertainty o"i„m. The last 
column lists the references for the observed mass (either M200C. or Maoora), on which the analysis is based on. 



Cluster 


z 


M200C 


in units of M© 


'^20011 


in units of M© 


0"lnM 


ACT-CL JO 102 


0.87 








(2.16 
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0.32) 


X 


10'5 


0.2 


SPT-CL J2106 


1.132 








(1.27 
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0.21) 


X 


10'5 


0.2 


XXMU J2235 
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1.3) X 10'" 
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1.38) 


X 


10'" 


0.2 


XXMU J0044 


1.579 


(4.25 


± 


0.75) X lO'"* 


(4.46 


± 


0.79) 


X 


10'" 


0.3 
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0.203 


(2.7 


± 


0.6) X lO'"' 


(3.68 
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0.82) 


X 


1015 


0.25 
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10'5 
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Marriage et al. (2011) 
Foley etal. (2011) 
Jee et al. (2009) 
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Maughan et al. (20ir) 
Maughan et al. (2011) 
WilUa mson et al. (2011) 
Umetsu et al, (2011) 
Maughan et al. (20ir) 



As illustrated in Fig. \T\ the volume V is determined by the sur- 
vey area and the redshift interval z 6 [ziow.Zup], where ziow and Zup 
are the lower and upper boundary, respectively, which contain the 
individual cluster at the observed redshift Zobs- The choice of the in- 
dividual redshift intervals is discussed in further detail in Section|4] 



3 THE STUDIED CLUSTERS 

In order to demonstrate the usability of GEV for the study of very 
massive clusters at high redshifts, we decided to apply the method 
to several observed clusters known to be rather massive given their 
detection redshift. We divide our analysis into two parts: in the first 
one, we consider four high-z systems and in the second one four 
systems in the low redshift Universe. 



3.1 The high-z systems 

As high redshift systems, we chose the following four objects: 



ACT -CL J0102-4915: This recenfly discovered jMarriage et"al] 
I2OI if ) merging s ystem is currently the most massive cluster ob- 
served at z > 0.6 l lMenanteau et alj201 ih and has been detect ed by 
the Atacama Cosmology Telescope (ACT) dFowler et al J 20071) in its 
755 deg- field. Its mass has been determined by a combination of 
Sunyaev-Zeldovich (SZ) dSunvaev & Zeldoviclj[l97lll980h . opti- 
cal (Very Large Telescope), X-ray (Chandra), and infrared (Spitzer) 
data to be M2oom = (2.16 ± 0.32) x 10'^ Mq at a spectroscopic red- 
shift of z = 0.87. Si nce the survey areas o f ACT and the South 
Pole Telescope (SPT) dCarlstrom et'ai]|2oT ll) overlap and this clus- 
ter lies in the overlap region, we conservatively decided to assign 
the combined survey area of 2 800 deg^ to this system. 

SPT-CL J2106-5844: This recently reported cluster jpolev et al.1 
I2OIII : IWilliamsonetal.ll201lh has been detected by the SPT sur- 
vey in its 2 500 deg^ field. The mass inferred from a combination 
of SZ and X-ray information is found to be A/2oom = (1.27±0.21)x 
10'^ Mq and spectroscopy of the member galaxies locates this clus- 
ter at a redshift of z = 1.132. These observed parameters make this 
cluster the most massive cluster at redshift z > 1 and thus a highly 
interesting system to study. 

XMMU J2235.32557: This object has been f ound by the XMM - 
Newton Distant Cluster Project (XDCP) survey dMullis et al.l2005l) 
in an area of 1 1 deg^. By com bining X-ray, weak tensing and veloc- 
ity dispersion m easurements jMullis et alj2005l : lRosati et alj200^ : 
Ijee et aljlioosl) the mass of this cluster has been estimated to be 



M200C = (7.3 ± 1.3) X 10" Mq and it is located at a redshift z=\A. 
XMMU J2235 was the trigger of many studies on the possibility of 
using massive clusters at high redshifts as cosmological probes as 
already summarised in the Introduction. 
XMMU J0044.0-2033: Like XMMU J 2235, this clitster h as re- 
cently been detected by the XDCP survey (' Santos etal J2OIII) m an 
area of ~ SOdeg^. The cluster is located at z = 1.579 and its mass 
was found to be in the range of M200C = (3.5 - 5.0) x 10" Mq, which 
puts this cluster into the forefront of currently known high-mass 
and high-z clusters. For the current work we adopted the central 
value M200C = (4.25 ± 0.75) x 10" Mq as the mass for this system. 

3.2 The low-z systems 

The chosen objects in the low redshift Universe comprise: 

Abell 2163: The merg ing cluster A2163 i s one of the hottest X- 
ray clusters on the sky jAmaud et al.lll992h . At a redshift of z = 
0.203, this cluster has a mass of M200C = (2.7 ± 0.6) x 10'^ aii d 
has been studie d at mu l tiple wavel engths [see Squires et aljl 1997h : 
iRadovich et al] ( l2008l) : lNord et all ( 120091) : IOkabe et al.l ( I2OI ll) . for 
instance], 

Abell 370: While RXI1347 is known to be the most luminous X- 
ray cluster on the sky, A370 is known to be the most massive lens- 
ing cluster (Broadhurst et al. 2008; Broadhurst & Barkana 20CI^ 
with a mass of M200C = (2.21 ± 0.27) x 10'^ Mq dUmetsu et"al1 
l201lh at a redshift of z = 0.375. 

RXJ1347-1145: This object has been found by the ROSAT All 
Sky-Sun'ey (RASS) and is known to be the most luminous X-ray 
clu ster on the sk y. It has been studied in the optical and X-ray (see 
e.g. lSchindleret al. ( 1995, 1997); Ettori et al. (2001)) as well as via 
the SZ dKomatsu et al..J999, ; .Pointecouteau et al.iJ999i) . The mass 
of this cluster has been found to be M200C = (2.1 ± 0.5) x 10'^ Mq 
and it is located at a redshift z = 0.45 1 . 

1E 0657-558 : Widel y known as the "Bu llet-Cluster" 

(Tuc ker et alj 1 19981 : iMarkevitch eTal] I2OO2I ; IClowe et al.l 
1200^ this dynamically very interesting system is with 
M2oom = (3.12 ± 1.15) X IO'^mJ] at z = 0.296 the mos t 
massive cluster in the SPT survey field ( Williamson et al. I l201lh 
because of which it has been added to this study. In addition to the 
SZ based mass from above, we added also the X-ray mass to our 
study as discussed below. 



* The statistical and systematic errors from Tab. 6 of IWilliamson et al] 
i201 ll) have been added in quadrature. 
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The X-ray masses for A2163, RXJ1347 and 1E0 657-558 have 
been e stimated through the M - T scahng relation in lAmaud et alj 
ll2005h (Tab. 2), by adopting th e X-ray temperatu re estimated in the 
(0.151)i?5oo aperture from Maug han etal] (l201 ll): 15.2± 1.2keV for 
A2163, 14.2± 1.4keV for RXJ1347 and 11.7 ± 0.5 keV for 1E0657 
that correspond to M200C = (2.7+0.6)xlO'^Mo, (2.1±0.5)xlO'-^Mo 
and (1.75 ± 0.29) x IO'^Mq, respectively. In order to apply a GEV 
analysis to the aforementioned low-z clusters, we have to define 
the survey area. By adding the areas for the northern (NORAS) 
and southern (REFLEX) cluster samples of the RASS, one finds 
A, = 27490deg^ which we will adopt for our analysis. 
It should be mentioned that all observed masses reported in the sec- 
tion are measured with respect to the critical background density, 
apart from the ACT and SPT clusters, whereas the masses from 
theory are usually defined with respect to the mean background 
density. This issue will be discussed in more detail in the follow- 
ing section. A compilation of the data of all studied clusters can be 
found in Tab.[T] 



4 THE GEV ANALYSIS 

In order to apply a GEV analysis to the aforementioned single sys- 
tems with the goal of quantifying their probability of existence in 
a ACDM cosmology, one has has take several effects into account 
that will be discussed in the following. 



4.1 Bias due to the a posteriori choice of the redshift interval 

The most important effect that has t o be consid e red is the bias, in- 
troduced and discussed in detail in iHotchkisj ( [201 l|), that stems 
from the a posteriori definition of the redshift interval for which 
the likelihood of a cluster is calculated. When the redshift inter- 
val is defined a p osteriori (see e.g. M ortonson et al. (2011) and 
iFolev et al] ilOl ih ), one ignores the fact that the potentially rare 
cluster of interest could have easily appeared at a different redshift 
leading to a different probability of existence. In order to avoid this 
bias we decided to define a priori four redshift bins < z < 0.5, 
0.5 ^ z < 1.0, 1.0 < z < 1.5 and 1.5 ^ z < 3.0 and to calculate 
the probability distributions of the most massive cluster in these in- 
tervals for a given survey area. A,, as illustrated in Fig. [T] Then, 
we select, out of the clusters listed in Section[3l the most massive 
ones that fall in these individual bins and quantify their probability 
of existence. The effect of the bias is shown in Fig.|2]as a substan- 
tial shift of the probability distributions depending on the choice of 
the redshift interval and is discussed in more detail below in Sec- 
tion|431 



where z is the cluster's redshift and Ra = c^r^ is the radius within 
which the mean cluster overdensity is A times Pc(z)- The relation 
with the concentration ca and the scale radius fs holds by definition 
of the NEW mass profile (Navarro et al. 1997). In order to change 
the mass definition to the one of the mean background density one 
has to scale A = 200 by n^(z) = n,^(l -1- z)^/H^(z)- Due to the fact 
that Ma/(R-^^A) is constant by definition one can write 

Ma ^ M200 A 
^ c?oo 200' 

where and C200 are related through the assumed NEW mass den- 
sity profile by 

ln(l +ca)-ca/(1 -hca) _ A ^^^^ 
Ca / hi(l -I- C200) - C2oo/(l + C200) 200 ' 

Together with the c-M relation of Zhao et al. (2009), which we 
adopted for this work, one can directly scale from M200C to Mjoom- 
For the selected clusters the results are given in the third column of 
Tab.E 

Since the mass function is very steep at the high mass end, 
it is more likely that lower mass systems scatter up than higher 
mass systems scatter down, resulti ng in a systemati c shift. This ef- 
fect is known as Eddington bias ( lEddington|[l913l) and has to be 
corrected for when observed masses have to be related to the distri- 
bution functions of the most massive halo. This is done, following 
iMortonson et alj ( 1201 li) . by shifting the observed mass, M^bs, to a 
corrected mass. Mean, by 

1 

In = In Mobs + :^Ecr^^M, (12) 

where e is the local slope of the mass function (d«/dlnM oc M*^) 
and cr\nM is the uncertainty in the mass measurement. Eor the sys- 
tems listed above we inferred ctj^m from the reported observational 
limits on the mass and rounded them up, just to be on the conser- 
vative side. The adopted values for iXbiM are listed together with 
the Eddington bias corrected masses in Tab. [T] for all eight stud- 
ied clusters. The variations in (r]„M stem from the fact that some 
masses were estimated by the combination of different probes, like 
Lensing+SZ+X-ray, whereas some others are based only on a sin- 
gle probe. 

The two corrections, the change of mass definition and the Edding- 
ton bias, work in opposite directions, because the former increases 
the cluster mass and the latter lowers it. The impact of the different 
mass definition is stronger at low redshifts, since at higher redshift 
i^m(z) ~ 1, resulting in a correction of 5 - 6 per cent for the high-z 
clusters in comparison to ~ 36 per cent for A2163. Eor a statistical 
analysis it is crucial to take both effects into account or the drawn 
conclusions might be wrong. 



4.2 Preparing the observed cluster masses for the analysis 

Another obstacle to be overcome is the unification of the mass def- 
initions. All observed masses M200 stated above, apart from ACT- 
CL J0102 and SPT-CL J2106, are defined with respect to the crit- 
ical background density Pc(z) = 3H^(z)/(inG), whereas for the 
theoretical mass function the mean background density is assumed 
Pm(z) = i^m(z)Pc(z). In Order to compare the two, we have to scale 
the cluster masses to the definition used in the mass function. Eor 
an arbitrary overdensity. A, the mass is defined as 

MA=^nRlApAz), (9) 



4.3 Application to single clusters 

After having introduced all the effects that have to be taken into 
account for a sound statistical analysis of the existence probabil- 
ity of single very massive clusters, we now apply GEV to the four 
most massive clusters in the four redshift intervals summarised in 
Tab. |2] In order to demonstrate the impact of the bias discussed in 
Section im we computed the GEV distributions for both choices, a 
priori and a posteriori, of the redshift intervals. The results of these 
calculations are shown for the four clusters A2163, ACT-CL J0102, 
SPT-CL J2106 and XXMU J0044 in Eig.|2]from top to bottom. Eor 
each cluster the calculations are based on the corresponding survey 
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area ranging from = 27 490 deg" for A2163 to A, = 80 deg" for 
XXMU J0044. The survey areas are also given in each left-hand 
panel of Fig. [2] In the figures, the red, dash-dotted curves show the 
distributions for the biased case with z e [Zobs> 3.0] and the black, 
solid lines for the a priori defined redshift bins, as denoted in the 
key. It can be nicely seen in Fig.[2]how, in the biased case, the clus- 
ters move towards the tail of the distribution and appear therefore 
less likely to be found. Depending on at what redshift the cluster of 
interest resides at in the redshift bin, the effect of the bias is more or 
less pronounced, as can be seen by comparing the PDF of A2163 
with the ones of ACT-CL J0102 or SPT-CL 12106. The bias is sub- 
stantial in the sense that the existence probabilities increase and the 
allowed range due to the mass error widens, as is illustrated by the 
red and grey areas shown in the right-hand panels of Fig.|2]for the 
CDFs. The reason for the big difference in the allowed probability 
region is the steepness of the CDF of the most massive cluster in 
the given survey volume. 

For the probability to find the most massive cluster in a given bin 
to be more massive than the observed one, we obtain the follow- 
ing results: Pr{m > Mobs] = 57.2 per cent for A2163 in bin 
1, Pr{m > Mobsl = 38.1 per cent for ACT-CL J0102 in bin 2, 
Pr{m > Mobsl = 22.6 per cent for SPT-CL J2106 in bin 3 and 
Prim > Mob,] = 14.4 per cent for XXMU J0044 in bin 4. The al- 
lowed range in the existence probability due to the uncertainty in 
the cluster mass determination is listed in the rightmost column of 
Tab.H] 

This result clearly states that none of the very massive clus- 
ters reported so far can be considered to be in tension with 
ACDM, even when the upper allowed mass limit is taken. At this 
point, our findings differ sig nificantly from the results presented in 
IChongchitnan & Silkl j201lM who find that XXMU J0044 resides 
in the extreme tail of the GEV distribution, even when taking the 
full-sky as survey area. This is in contradiction with our results ac- 
cording to which, even for A, = 80deg^, we find a probability of 
existence of 14.4 per cent. In our study we neglected any overlap 
of the XDCP survey footprint with other surveys in the same area 
of the sky: taking this into account would increase the effective sur- 
vey area and thus also the probability of existence. An alternative 
argument is based on a simple calculation of the number of clus- 
ters A'(> m) more massive than in for the reported mass range and 
a redshift of z > 1.579. The results range from ~ 7-80 (or even 
30 - 250, when using the lSheth & TormenHl999h mass function as 
done by the authors), supporting our findings that this system can 
be considered as perfectly normal and expected in a ACDM Uni- 
verse. In view of this, the claim that this system is an indicator for 
non-Gaussianity on the level of /nl - 360 seems to be highly ques- 
tionabl^E 

For the GEV analysis performed in this section, we compared 
the most massive observed cluster with the theoretical distribution 
function. In doing so, it is assumed that the observed most mas- 
sive cluster is also the true most massive cluster in the volume of 
interest. In the lowest redshift bin of < z < 0.5 for instance, all 



Table 2. Selection of the most massive clusters in the four redshift intervals 
and the inferred probability of existence. 



bin 


z-intervall 


cluster 


Pr{m > Mobsl 


range 


1 


z < 0.5 


A2163 


57.2% 


(11.3 


- 99)% 


2 


0.5 <z< 1.0 


ACT-CL J0102 


38.1% 


(9.9- 


83.2)% 


3 


1.0 <z< 1.5 


SPT-CL J2106 


22.6% 


(5.2- 


66.7)% 


4 


1.5 <z<3.0 


XXMU J0044 


14.4% 


(4.2- 


39.9)% 



SPT-CL J2106-5844 



Q 

U 




A, [degl 



Figure 3. CDF as a function of survey area, Aj, with fixed observed masses, 
'^foora' SPT-CL J2106. The grey shaded areas show the uncertainty in 
the mass measurements and the black triangle denotes the SPT survey area 
of As = 2 500 deg^. The red, dash-dotted line shows the biased results based 
on a posteriori choice of the z-interval and the black, solid lines are for an a 
priori fixed interval of z e [LO, 1.5]. 



four low-z clusters from Tab.[T]exhibit potential overlap in their al- 
lowed masses, such that e.g. the true most massive cluster in the 
redshift interval could be A370 instead of A2163. However, since 
all the clusters in this redshift interval are very likely to be found in 
ACDM, the results would remain unchanged even if another clus- 
ter than A2163 was the true most massive one. For the high redshift 
intervals, the chosen systems are so extreme in their respective sur- 
vey areas that it is very unlikely that another system is the true most 
massive one. In general, one should keep in mind that the probabili- 
ties stated above are grounded on the assumption that the individual 
cluster is the true most massive one in the volume. If this is not the 
case the exact numbers from above would change, but the conclu- 
sion that none of them is in significant tension with ACDM would 
still hold. The same statement is also valid for a different a priori 
choice of the redshift intervals. 



^ The au thors include also the effect of the halo bias in their calculations. 
However. lDavis et al.l j201lh showed that the difference between the asymp- 
totic form introduced in Section[2]and the full calculation is not very laige 
on large scales (the position of the peak and the tail behaviour match rather 
well). Furthermore, this difference is certainly smaller than needed to ex- 
plain the strong discrepancy with respect to our results. 
^ We would also like to point out that a likelihood analysis based on a 
single data point and the inference of the most likely /nl from it can be 
highly misleading. 



5 COIMPARING HIGH-Z WITH LOW-Z CLUSTERS 

The existence probabilities listed in Tab. |2] seem to decrease with 
increasing redshift such that high-z clusters appear to be rarer than 
low-z ones. This could naively be interpreted as a signature of a 
modification of the growth of structure, or as a substantial bias of 
the high-z cluster mass estimates. In the following we will address 
this point in more detail. 
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\ [deg-] [deg-] [deg'] A^ [deg'] 

Figure 4. Biased CDF as a function of survey area. As, with fixed observed masses, M^^^, for all eight high-z (upper row) and low-z (lower row) clusters 
aiTanged from left to right and top to bottom: ACT-CL J0102, SPT-CL J2106, XMMU J2235, XMMU J0044, A2163, A370, RXJ1347 and 1E0657. The grey 
shaded areas show the uncertainty in the mass measurements and the dift'erent symbols denote particular choices of the survey area which are: As = 775 
(triangle) and 2 800 deg" (square) for ACT-CL J0102; As = 2 500deg^ (triangle) for SPT-CL J2106; As = 80 (triangle) , 250 (square) and lOOOOdeg^ (circle) 
for XMMU J2235 and XMMU J0044. And As = 2 500 (triangle) and 27490 deg^ (square) fo r all four low-z sys tems in the lower row. For 1E0657, the red. 
dash-dotted curves in in the lower left-hand panel show the results based on the SZ mass t Williamson et alj201lh and the black, solid lines the results for the 
X-ray mass ^Maughan et al .1201 ll) . 



5.1 Impact of ill-defined survey areas 

For a statistical analysis of high-z clusters, the exact definition of 
the survey area is crucial if conclusions regarding the cosmological 
model have to be drawn. For a given mass of a cluster, a too small 
area will lead to a shift of the most likely maximum mass to smaller 
masses and, hence, amplify a possible discrepancy with respect to 
the fiducial ACDM model. A too large survey area would shift the 
peak of the PDF to the right and therefore alleviate possible tension 
with ACDM. Particularly at high redshifts, it can be difficult to de- 
fine the proper survey area since the cluster of interest might have 
been observed by multiple surveys with different selection func- 
tions and observing strategies. 

Instead of plotting the CDFs and PDFs as a function of mass and 
comparing an observed cluster with it, as presented in Fig. |2l it is 
more meaningful to present the CDF for a fixed observed mass as a 
function of survey area. This is shown in Fig.[3]for SPT-CL J2106 
for both the biased (red, dash-dotted lines) and the unbiased (black, 
solid lines) case. The grey shaded areas represent the range in the 
CDF allowed by the uncertainty in the mass estimates. It can di- 
rectly be seen how the cluster becomes more likely to be found as 
the survey area increases. As long as the survey areas are not clearly 
defined, the position of the curve is the best indicator of how ex- 
treme a cluster is. The further right (larger As) the curve drops to 
zero, the more extreme the cluster is. 

For instance, from Fig.[3]we can infer that if SPT-CL J2106 would 
have been detected in a fictive survey area of 100deg^the assigned 
probability to find this cluster would be less than 1 per cent. How- 
ever, detected in the SPT field of As = 2 500deg^, the probability 
increases already to 22.6 per cent and it would increase further if 
the survey area had been increased without finding a more massive 
object in the redshift range. 



In order to comp are our GEV based analysis with the results of 
Ipolev et al. l l201lh . who did not take the bias discussed in Hotchkis^ 
( 120111) into account, one has to look at the red, dash-dotted line in 
Fig. [3] In doing so and by taking the exact reported mass of SPT- 
CL J2106, we obtain a biased probability of < 7 per cent to find 
such a system in the erroneously a posteriori fixed redshift interval. 
It agrees well with the findings of Foley et al. ( 201 IJ, who report a 
probability of < 5 per cent for such a system b ased on a full like- 
lihood calculation uti lising a MCMC method dVanderlinde et al.l 
l20ld : iLewis & Bridie : 2002) for the WMAPl cosmology. We ob- 
tain a slightly higher probability due to our conservative choice of 
(TiuM, resulting in a larger correction for the Eddington bias towards 
smaller masses. This result shows that GEV could serve as a tool 
for quick analysis of extreme clusters at high redshifts before a full 
likelihood analysis is performed, particularly from the point of view 
that it can easily be adapted to alternative cos mological models , 
like e.g. models with no n-Gaussianity (Chongchitnan & Silkl201ll) 
or quintessence models dWaizmarm et al.l 201 1 ). 



5.2 Comparison of all eight clusters 

In the previous section, we showed that the apparent decrease of 
the existence probability with increasing redshift could very likely 
be explained by the relatively small survey areas at high redshifts. 
In order to study this in more detail, we decided to perform a GEV 
analysis for all eight clusters discussed in Section [3] However, we 
do not correct for the bias discussed in Section|4T|and choose the 
redshift intervals a posteriori to be z e [Zobs> 3.0]. The results of this 
calculation are shown in Fig. |4l again in the form of the CDF as 
function of the survey area, for the four high-z clusters in the upper 
row and for the four low-z ones in the lower row. It is important to 
note that now the CDF can no longer be interpreted as a measure 
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quantile p=95 quantile p=99 quantile p=99.9 




Figure 5. Contour plots for a fixed p = 95, 99 and 99.9 (from left to right) in the plane spanned by the cluster mass and survey area for four dift'erent redshift 
intervals as denoted on each iso-contour. For better comparison the survey areas of the RASS, SPT and XDCP are illustrated by the red, dash-dotted lines. 



of the unbiased probability of tension with ACDM, but only as a 
measure of relative extremeness. 

Even with the biased measure, none of the clusters is particularly in 
tension with ACDM, but they can all be considered as rather per- 
fectly normal and expected systems. When confronting the low-z 
with the high-z systems on the basis of the CDF as a function of 
the survey area, the relative positions of the curves do not give any 
support for the claim that clusters at high-z or more extreme than 
low-z ones. 

If A2163, A370 and RXJ1347 had been observed in a 5Pr-like 
field (triangle) instead of the RASS one (square), the resulting bi- 
ased probabilities of their existence would be similar to the high-z 
ones (especially when taking into consideration that larger crinM 
values lead to smaller Eddington bias corrected masses for the low- 
z systems). And, vice versa, an increase of the survey area for the 
high-z systems would alleviate their low probabilities of existence, 
unless more extreme objects are found in the future. 
By looking to currently known very massive single objects at low 
and high redshifts and analysing their CDFs as function of area, 
one does not find any evidence for trend with redshift that high-z 
systems are more extreme than low-z systems. Therefore there is 
no indication for deviations from the ACDM expectations so far. 
Of course, when drawing conclusions from this fact, one has to be 
careful, since at high-z the sufficiently deep covered survey area is 
only a fraction of the low redshift one. Furthermore, cluster mass 
estimates, particularly at high redshifts, are delicate and possible 
biases at high-z are not yet fully explored. As a verdict it is cer- 
tainly interesting to confront the most massive clusters at different 
redshifts with the expectations from theory once a sufficiently deep 
full-sky cluster survey will be available. 



6 EXTREME QUANTILES 

All the probability ranges stated in the previous section are very 
closely related to the statistical concept of quantiles. The extreme 
quantile function based on GEV is the inverse of the CDF from 
equation l|2j and is found to be given by 



«-fii-[-in(4)r 

„_^l„[_l„(Ji.)], 



for y to, 
for 7 = 0, 



(13) 



A quantile x,, is the value of the random variable for which the 
probability Pr{Xj < Xp] is p or, in our case of interest, it is the mass 
of a cluster such that, in p percent of the observed patches, the 
most massive cluster has a smaller mass than the p-quantile mass. 
Usually Xp is also referred to as return level and it is connected to 
the return period Tp = 1/(1 - p/ 100). 

For the case studied here, it is interesting to compute the lines 
for the four a priori defined redshift intervals (see Tab. O in the 
plane spanned by mass and survey area that would correspond 
to a given fixed quantile p. The results are shown in Fig. [5] for 
p = 95 (left-hand panel), p = 99 (central panel) and for p = 99.9 
(right-hand panel), where the red, dash-dotted lines denote from 
top to bottom the survey areas of the RASS, SPT and XDCP 
surveys. It can be seen that the contours for fixed redshift bins and 
quantile p exhibit a steep increase with mass and they also reflect 
the fact that, in high-z bins, smaller masses are needed to reach a 
given quantile. 

The contour plots in Fig. |5] confirm the results from Sec- 
tion |4j3] in which we found that none of the very massive known 
clusters, neither at high nor at low redshifts, is in tension with 
ACDM. If we define the p = 99.9 quantile as the line where the 
ACDM model is excluded and we assume a iM55-like cluster sur- 
vey, then one would need to find a cluster of Maoom > 1 .4 x lO'"" Mq 
at z > 1.5. For the third redshift bin of I.O < z < 1.5, one would 
require a mass of M2oom > 2.5 x 10" Mq, which is roughly the 
double of the mass of SPT-CL J2I06. For the lowest redshift bins 
of 0.5 < z < 1.0 and 0.0 < z < 0.5, we find Mzoom > 4.5 x 10'^ Mq 
and Mioom > 6.7 x 10'^ Mq, respectively. The existence of the 
latter can a lready nowadays be excluded on the basis of RASS 
and MACS jEbeling etal]|200lh surveys a nd for redshifts below 
z = I PLANCK Jrauber. J. A. et alj|2010h should be capable of 
delivering a definite answer. F or the high redshift e nd with z > 1, 
future missions like E UCLID (Laure iis et Zll201ll) and eROSITA 
dCappelluti et alj [201 1 ) should definitely be able to detect such 
extreme systems if they exist. 

The lesson to be learnt from the exercise above is that GEV 
could offer a quick way for characterising the statistical expecta- 
tions for very massive high-z clusters. But for cosmological model 
testing, a proper knowledge of the survey area is crucial. Of course, 
it is not only the area but also the depth that matters because other 
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surveys covering the same object should ideally be as deep as the 
original survey to be matchable, which usually is a particular prob- 
lem for the very high-z clusters. Furthermore, even if one finds a 
high quantile cluster, let's assume p > 99 in a small survey area, 
one should keep in mind that, when sampling a GEV distribution, 
such systems have to exist if the sample is big enough. Considering 
the current high-z coverage this is all that can currently be done, but 
with future full-sky surveys it might be int erest i ng to try to measure 
the CDF itself as proposed in .Waizmann et alj jlOllh . 



7 SUMMARY AND CONCLUSIONS 

In this work we presented the application of general extreme value 
statistics on very massive single clusters at high and low redshifts. 
After introducing the formalism, we applied these statistics to 
eight very massive clusters at high and low redshifts. On the high 
redshift side, those clusters comprise the ACT detected cluster 
ACT-CL J0102-4915, the SPT cluster SPT-CL J2106-5844 and 
two clusters found by the XMM-Newton Distant Cluster Project 
(XDCP) survey: XMMU J2235. 32557 and XMMU J0044.0-2033. 
For the low redshift systems, we considered A2163, A370, 
RXJ1347-1145 and 1E0657 where the latter was added to the 
study because it is the most massive cluster in the SPT survey. 
By computing the CDFs and PDFs for the individual survey 
set-ups as well as fictitious survey areas, we relate the individual 
systems with the probability to find them as most massive system 
for a given size of the survey area. In order to avoid the bias arising 
from the posterior choice of the redshift interval, as discussed 
in[Hptchkiss (2011), we define the redshift intervals prior to the 
analysis to be: 0.0 < z < 0.5, 0.5 < z < 1.0, 1.0 < z < 1.5 
and 1.5 < z < 3.0. From the aforementioned eight clusters, we 
chose the respective most massive one in each individual bin and 
calculate its existence probability in a ACDM cosmology. 
We find that, in the lowest redshift bin of 0.0 < z < 0.5 and 
the PASS survey area of 27 490deg^, A2163 has an existence 
probability of ~ 57 per cent in a range of ~ (11 - 99) per cent 
reflecting the uncertainty in the mass determination. In the second 
redshift interval of 0.5 < z < 1.0, ACT-CL J0102 has, in the 
combined survey area of ACT and SPT of 2 800 deg^, an existence 
probability of ~ 38 per cent in a range of ~ (10 - 83) per cent. For 
SPT-CL J2 106-5 844, we find ~ 23 per cent in a range of ~ (5 - 67) 
per cent in the interval of 1.0 < z < 1.5 and the SPT survey area 
of 2500deg-. In the highest redshift bin of 1.5 < z < 3.0, we 
find for XMMU J0044.0-2033 in the 80deg- XDCP survey area a 
probability of ~ 14 per cent in a range of ~ (4 - 40) per cent. This 
result, by classifying this system as not in t ension with ACDM at 
all, di ffers significantly from the results of IChongchitnan & Silkl 
( I2OII ) who claim this system to be in extreme tension with ACDM 
even if the full-sky is assumed as survey area. Therefore, none of 
the clusters can be considered to be in tension with ACDM even if 
the upper allowed observed mass is assumed. 
If we neglect the aforementioned bias arising from the posterior 
choice of the redshift interval, as it has been done in mo st of 
the literature in the past years (apart from iHotchkissI ( 1201 ih and 
lHovleetalJi2011bl) ), and fix the redshift interval a posteriori, we 
find for SPT-CL J2106-5844 a probability of < 7 per cent. Thi s 
result is in good agreement with the findings ofl Folev et al.l j201 ih . 
who report a probability of ;£ 5 per cent based on a full likelihood 
analysis. The agreement of the two results advocates GEV as a 
quick tool for analysing the probability of observed high-mass 
clusters before a full likelihood analysis is performed. 



By confronting the results of our survey area based GEV analysis 
for both, low and high redshift clusters, we do not find any 
indication for a trend in redshift in the sense that high-z systems 
are more extreme than low-z ones. By studying the CDFs of the 
observed clusters as a function of the survey area, we show that 
the most likely explanation for the lower existence probabilities 
of current high mass, high-z clusters with respect to low-z ones, is 
the lack of sufficiently deep and large high-z surveys. Thus, based 
on current data, we do not see any tendency for a deviation from 
ACDM for the most massive clusters as a function of redshift. 
Of course, one should also be aware that, apart from the fact that 
the current observational data are not deep and complete enough, 
also possible biases can be present in the cluster mass estimates, 
particularly at high redshifts. 

In addition, we introduced the extreme quantiles and calcu- 
lated the contours corresponding to a fixed p-quantile for a fixed 
redshift interval in the plane spanned by cluster mass and survey 
area. These contours allow one to infer what mass a cluster would 
need to have in order to cause a substantial tension with ACDM. 
We find that a cluster with an existence probability of 1 : 1000 
would have to have at least a mass of Mzoom > 4.5 X 10'^ Mo 
in 0.5 < z ^ 1.0, Mjoom > 2.5 x lO'"' Mg in 1.0 < z < 1.5 and 
Mioom > 1-4 X 10'^ Mq for z > 1.5. The first two correspond to 
twice the mass of the most massive currently known cluster in these 
redshift intervals and the last one corresponds to a cluster similar 
to SPT-CL 12106 at z > 1.5. These very high masses make more 
and more questionable the possibility to find a single cluster with 
the potential to significantly question ACDM. However, ongoing 
and future large area surveys like PLANCK, eROSITA and EU- 
CLID have the capabilities to give a definite answer to this question. 

Thus, the main conclusions that can be drawn from this work 
can be summarised as follows: 

(i) None of the currently known very massive clusters at high 
and low redshifts exhibits tension with ACDM. 

(ii) There is no indication for very high-z clusters being more 
extreme than low-z ones and therefore no indication of deviations 
from the ACDM structure growth or of strong redshift depended 
biases in the cluster mass estimates. 

(iii) Clusters with the potential to significantly question ACDM 
would require substantially higher masses in the different redshift 
regimes than currently known. 

(iv) GEV is a valuable tool to understand the rareness of massive 
galaxy clusters and delivers comparable results to more costly full 
likelihood analyses. 

As a closing word of warning, one should be cautious when trying 
to reject a distribution function and thus a underlying cosmolog- 
ical model by means of a single observed object. Very rare real- 
isations of a random variable do have to exist and by observing 
just one there is no telling whether it stems from the tail of the as- 
sumed PDF or from the peak of another one. For small patches, 
one may solve this dilemma by obser ving many of those an d di- 
rectly reconstruct the underlying CDF jWaizmarm et alj|201 ih : for 
large patches, however, one is limited by cosmic variance. 
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